Improving Region Selection Through Early-Exit Detection

نویسندگان

  • Chun-Chen Hsu
  • Pangfeng Liu
  • Jan-Jan Wu
  • Chien-Min Wang
  • Ding-Yong Hong
  • Wei Chung Hsu
چکیده

Many dynamic binary translation (DBT) systems and just-in-time compilers target traces, i.e. frequently-taken execution paths, as code regions to be translated/optimized. The Next-Tail-Execution (NET) trace selection method used in HP Dynamo is an early example of such techniques. Many current trace optimization schemes are actually variations of NET. These NET-like trace optimizations work very well for most traces, but they also suffer the same problem: the selected traces may contain a large number of early exits that could branch out in the middle of traces. If early exits are taken frequently during program execution, the benefit of trace optimization could be lost due to the overhead of costly compensation code in the trace epilogue. We refer to traces/regions with frequently taken earlyexits as delinquent traces/regions. Our empirical study shows that at least 9 of the 12 SPEC CPU2006 integer benchmarks have delinquent traces, i.e., if we use NET to select traces, each of these nine benchmarks will take more than 100 early exits per million executed instructions in their traces. In this paper, we significantly improve the performance of NET by merging delinquent traces into larger code regions. We propose a light-weight region formation technique called Early-Exit Guided region selection (EEG)to improve the performance by iteratively detecting and merging delinquent regions into larger code regions. Hardware assisted dynamic profiling is first used to identify hot code regions without incurring significant runtime overhead. Key software counters are then instrumented at the exit points of the hot regions to detect early exits. When a counter exceeds certain threshold value, the code region that begins at the branch target of that early exit is merged into the main code region. We also employ a heuristic to decide whether it is beneficial to merge the selected regions or not. We will not merge two regions if the cost of spill code is too high for the merged code. We implement our EEG algorithm in two LLVM-based parallel dynamic binary translators. These two parallel dynamic binary translators are for ARM and IA32 instruction set architecture (ISA) respectively, and both use multiple compilation threads to compile different code regions concurrently. We evaluate the performance of EEGwith two benchmark suites: the SPEC CPU2006 single-threaded benchmark suite with reference inputs, and the PARSEC multi-threaded benchmarks with native inputs. The experimental results show that, compared to NET, EEG can achieve a performance improvement of up to 67% (13% on average) for SPECCPU2006 integer benchmarks, and up to 20% (10% on average) for PARSEC multi-threaded benchmarks.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Factors Affecting Herders Migration Time to Summer Rangelands (Case Study: Kouhdasht Rangelands, Lorestan Province, Iran)

The main objective of this study was to assess factors affecting early entrance and later exit of herders from summer rangelands. The study population consisted of heads of Nomads in Kouhdasht region, Lorestan province, Iran. According to Cochran formula among 1038 Nomadic families, 310 families were selected with stratified random sampling in 2015. In order to assess validity, expert's opinion...

متن کامل

Enhanced detection of early photons in time-domain optical imaging by running in the "dead-time" regime.

Optical tomography can yield anatomical and molecular information about biological tissue. However, its spatial resolution is poor in thick samples owing to high scatter. Early photon approaches, where photon arrival times are measured with time-resolved detectors, provide one means of improving spatial resolution through selection of photons that travel a straighter path. Here, a novel approac...

متن کامل

Wavelength region selection and spectrophotometric simultaneous determination of naphthol isomers based on net analyte signal

Naphthol isomers were simultaneously and spectrophotometrically determined in wastewater, using a model based on net analyte signal (NAS). The calibration method used is a variation of the original hybrid linear analysis method as proposed by Goicoechea and Olivieri (HLA/GO). Owing to spectral interferences, the simultaneous determination of mixtures of naphthol isomers, using a spectrophotomet...

متن کامل

Market Transparency, Adverse Selection, and Moral Hazard

We study how seller exit and continuing sellers’ behavior on eBay are affected by an improvement in market transparency. The improvement was achieved by reducing strategic bias in buyer ratings. It led to a significant increase in buyer satisfaction with seller performance, but not to an increase in seller exit. When sellers had the choice between exiting—a reduction in adverse selection—and im...

متن کامل

Improving Accuracy in Intrusion Detection Systems Using Classifier Ensemble and Clustering

Recently by developing the technology, the number of network-based servicesis increasing, and sensitive information of users is shared through the Internet.Accordingly, large-scale malicious attacks on computer networks could causesevere disruption to network services so cybersecurity turns to a major concern fornetworks. An intrusion detection system (IDS) could be cons...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012